NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Class-Imbalanced Learning on Graphs: A Survey

https://doi.org/10.1145/3718734

Ma, Yihong; Tian, Yijun; Moniz, Nuno; Chawla, Nitesh V (August 2025, ACM Computing Surveys)

Rapid advancement in machine learning is increasing the demand for effective graph data analysis. However, real-world graph data often exhibits class imbalance, leading to poor performance of standard machine learning models on underrepresented classes. To address this,Class-ImbalancedLearning onGraphs (CILG) has emerged as a promising solution that combines graph representation learning and class-imbalanced learning. This survey provides a comprehensive understanding of CILG’s current state-of-the-art, establishing the first systematic taxonomy of existing work and its connections to traditional imbalanced learning. We critically analyze recent advances and discuss key open problems. A continuously updated reading list of relevant articles and code implementations is available athttps://github.com/yihongma/CILG-Papers.
more » « less
Free, publicly-accessible full text available August 31, 2026
Do multimodal large language models understand welding?

https://doi.org/10.1016/j.inffus.2025.103121

Khvatskii, Grigorii; Lee, Yong Suk; Angst, Corey; Gibbs, Maria; Landers, Robert; Chawla, Nitesh V (August 2025, Information fusion)

This paper examines the performance of Multimodal LLMs (MLLMs) in skilled production work, with a focus on welding. Using a novel data set of real-world and online weld images, annotated by a domain expert, we evaluate the performance of two state-of-the-art MLLMs in assessing weld acceptability across three contexts: RV & Marine, Aeronautical, and Farming. While both models perform better on online images, likely due to prior exposure or memorization, they also perform relatively well on unseen, real-world weld images. Additionally, we introduce WeldPrompt, a prompting strategy that combines Chain-of-Thought generation with in-context learning to mitigate hallucinations and improve reasoning. WeldPrompt improves model recall in certain contexts but exhibits inconsistent performance across others. These results underscore the limitations and potentials of MLLMs in high-stakes technical domains and highlight the importance of fine-tuning, domain-specific data, and more sophisticated prompting strategies to improve model reliability. The study opens avenues for further research into multimodal learning in industry applications.
more » « less
Free, publicly-accessible full text available August 1, 2026
MOPI-HFRS: A Multi-objective Personalized Health-aware Food Recommendation System with LLM-enhanced Interpretation

https://doi.org/10.1145/3690624.3709382

Zhang, Zheyuan; Wang, Zehong; Ma, Tianyi; Taneja, Varun Sameer; Nelson, Sofia; Le, Nhi_Ha Lan; Murugesan, Keerthiram; Ju, Mingxuan; Chawla, Nitesh V; Zhang, Chuxu; et al (July 2025, ACM)

Free, publicly-accessible full text available July 20, 2026
Pure Message Passing Can Estimate Common Neighbor for Link Prediction

Dong, Kaiwen; Guo, Zhichun; Chawla, Nitesh V (December 2024, NeuralPS)

Message Passing Neural Networks (MPNNs) have emerged as the de facto standard in graph representation learning. However, when it comes to link prediction, they are not always superior to simple heuristics such as Common Neighbor (CN). This discrepancy stems from a fundamental limitation: while MPNNs excel in node-level representation, they stumble with encoding the joint structural features essential to link prediction, like CN. To bridge this gap, we posit that, by harnessing the orthogonality of input vectors, pure message-passing can indeed capture joint structural features. Specifically, we study the proficiency of MPNNs in approximating CN heuristics. Based on our findings, we introduce the Message Passing Link Predictor (MPLP), a novel link prediction model. MPLP taps into quasiorthogonal vectors to estimate link-level structural features, all while preserving the node-level complexities. We conduct experiments on benchmark datasets from various domains, where our method consistently outperforms the baseline methods, establishing new state-of-the-arts.
more » « less
Full Text Available
Graph Cross Supervised Learning via Generalized Knowledge

https://doi.org/10.1145/3637528.3671830

Yuan, Xiangchi; Tian, Yijun; Zhang, Chunhui; Ye, Yanfang; Chawla, Nitesh V; Zhang, Chuxu (October 2024, ACM)

Full Text Available
Application of Large Language Models in Chemistry Reaction Data Extraction and Cleaning

https://doi.org/10.1145/3627673.3679874

Huang, Xiaobao; Surve, Mihir; Liu, Yuhan; Luo, Tengfei; Wiest, Olaf; Zhang, Xiangliang; Chawla, Nitesh V (October 2024, ACM)

Chemical reaction data has existed and still largely exists in unstructured forms. But curating such information into datasets suitable for tasks such as yield and reaction outcome prediction is impractical via manual curation and not possible to automate through programmatic means alone. Large language models (LLMs) have emerged as potent tools, showcasing remarkable capabilities in processing textual information and therefore could be extremely useful in automating this process. To address the challenge of unstructured data, we manually curated a dataset of structured chemical reaction data to fine-tune and evaluate LLMs. We propose a paradigm that leverages prompt-tuning, fine-tuning techniques, and a verifier to check the extracted information. We evaluate the capabilities of various LLMs, including LLAMA-2 and GPT models with different parameter counts, on the data extraction task. Our results show that prompt tuning of GPT-4 yields the best accuracy and evaluation results. Fine-tuning LLAMA-2 models with hundreds of samples does enable them and organize scientific material according to user-defined schemas better though. This workflow shows an adaptable approach for chemical reaction data extraction but also highlights the challenges associated with nuance in chemical information. We open-sourced our code at GitHub.
more » « less
Full Text Available
NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning

https://doi.org/10.18653/v1/2025.acl-long.296

Zhang, Zheyuan; Li, Yiyang; Le, Nhi_Ha Lan; Wang, Zehong; Ma, Tianyi; Galassi, Vincent; Murugesan, Keerthiram; Moniz, Nuno; Geyer, Werner; Chawla, Nitesh V; et al (January 2025, Association for Computational Linguistics)

Full Text Available
Diet-ODIN: A Novel Framework for Opioid Misuse Detection with Interpretable Dietary Patterns

https://doi.org/10.1145/3637528.3671587

Zhang, Zheyuan; Wang, Zehong; Hou, Shifu; Hall, Evan; Bachman, Landon; White, Jasmine; Galassi, Vincent; Chawla, Nitesh V; Zhang, Chuxu; Ye, Yanfang (October 2024, ACM)

Full Text Available
Are we Making Much Progress? Revisiting Chemical Reaction Yield Prediction from an Imbalanced Regression Perspective

https://doi.org/10.1145/3589335.3651470

Ma, Yihong; Huang, Xiaobao; Nan, Bozhao; Moniz, Nuno; Zhang, Xiangliang; Wiest, Olaf; Chawla, Nitesh V (May 2024, ACM)

Full Text Available
Boosting Graph Neural Networks via Adaptive Knowledge Distillation

https://doi.org/10.1609/aaai.v37i6.25944

Guo, Zhichun; Zhang, Chunhui; Fan, Yujie; Tian, Yijun; Zhang, Chuxu; Chawla, Nitesh V. (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Graph neural networks (GNNs) have shown remarkable performance on diverse graph mining tasks. While sharing the same message passing framework, our study shows that different GNNs learn distinct knowledge from the same graph. This implies potential performance improvement by distilling the complementary knowledge from multiple models. However, knowledge distillation (KD) transfers knowledge from high-capacity teachers to a lightweight student, which deviates from our scenario: GNNs are often shallow. To transfer knowledge effectively, we need to tackle two challenges: how to transfer knowledge from compact teachers to a student with the same capacity; and, how to exploit student GNN's own learning ability. In this paper, we propose a novel adaptive KD framework, called BGNN, which sequentially transfers knowledge from multiple GNNs into a student GNN. We also introduce an adaptive temperature module and a weight boosting module. These modules guide the student to the appropriate knowledge for effective learning. Extensive experiments have demonstrated the effectiveness of BGNN. In particular, we achieve up to 3.05% improvement for node classification and 6.35% improvement for graph classification over vanilla GNNs.
more » « less
Full Text Available

« Prev Next »

Search for: All records